Improved Accuracy In Data Mining Decision Tree Classification Using Adaptive Boosting (Adaboost)

نویسندگان

چکیده

The Decision Tree algorithm is a data mining method that often applied as solution to problem for classification. C5.0 has several weaknesses, including: the and other decision tree methods are biased towards modeling whose features have many levels, some problems model can occur such over-fit or under-fit challenges, big changes logic result in small training, experience inconvenience, imbalance causes low accuracy algorithm. boosting an iterative gives different weights distribution of training each iteration. Each iteration adds weight examples misclassification decreases correct classification, thereby effectively changing data. One example adaboost. purpose this research improve performance classification using adaptive (adaboost) predict hepatitis disease Confusion matrix. Tests been carried out with Matrix use Hepatitis dataset which rate 80.58% error 19.15%. Whereas Adaboost higher 82.98%, 17.02%. This difference caused by adaboost algorithm, because able change weak classifier into strong increasing observations, also reduce rate.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Modelling Customer Attraction Prediction in Customer Relation Management using Decision Tree: A Data Mining Approach

In Today’s quality- based competitive world, known as knowledge age, customer attraction is of ultimate importance. In respect to the slogan “customer is always right”, customer relation management is the core of an organizational strategy playing an important role in four aspects of customer identification, customer attraction, customer retaining, and customer satisfaction. Commercial organiza...

متن کامل

the clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance

با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...

data mining rules and classification methods in insurance: the case of collision insurance

assigning premium to the insurance contract in iran mostly has based on some old rules have been authorized by government, in such a situation predicting premium by analyzing database and it’s characteristics will be definitely such a big mistake. therefore the most beneficial information one can gathered from these data is the amount of loss happens during one contract to predicting insurance ...

15 صفحه اول

Generalization and Decision Tree Induction: Efficient Classification in Data Mining

Efficiency and scalability are fundamental issues concerning data mining in large databases. Although classification has been studied extensively, few of the known methods take serious consideration of efficient induction in large databases and the analysis of data at multiple abstraction levels. This paper addresses the efficiency and scalability issues by proposing a data classification metho...

متن کامل

Land Cover Classification Using IRS-1D Data and a Decision Tree Classifier

Land cover is one of basic data layers in geographic information system for physical planning and environmentalmonitoring. Digital image classification is generally performed to produce land cover maps from remote sensing data,particularly for large areas. In the present study the multispectral image from IRS LISS-III image along with ancillary datasuch as vegetation indices, principal componen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Sinkron : jurnal dan penelitian teknik informatika

سال: 2023

ISSN: ['2541-2019', '2541-044X']

DOI: https://doi.org/10.33395/sinkron.v8i2.12055